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VOICE AUTHENTICATION DEVICE, VOICE AUTHENTICATION SYSTEM, 
AND VOICE AUTHENTICATION 'METHOD 

Technical Field 



authentication apparatus, voice authentication system 
and voice authentication method for recognizing a 
speaker's voice and authenticating the speaker. 

Background Art 

[0002] Conventionally, as a technology for identifying 
the communicating party with a mobile telephone, there 
is, for example, a technology which uses an application 
installed in a portable terminal that displays the 
communicating party' s telephone number when an incoming 
call arrives and, if it is a registered telephone number, 
displays the communicating party' s name or specific image 
or plays a specific ringing melody. 

[0003] Furthermore, as a conventional technology for 
personal authentication of the communicating party, there 
is, for example, a personal authentication system and 
personal authentication apparatus in mobile 
communication disclosed in Patent Document 1. With this 
system and apparatus, every time personal authentication 
is performed when a server at a control center provides 



[0001] 



The present invention relates to a voice 



2F04051-PCT 2 

service to a portable terminal, the server transmits 
predetermined text information to the portable terminal, 
and personal authentication is performed with voice 
generated by a user of the portable terminal according 
5 to this text information so as to prevent spoofing by 
a third party. 

Patent Document 1: Japanese Patent Application 

Laid-open No . 2 0 0 3 - 8 7 8 6 0 

10 

Disclosure of Invention 

Problems to be Solved by the Invention 

[0004] However, in the case of identifying the 
communicating party using the conventional application, 

15 the identification of the communicating party relies on 
a display of the communicating party' s telephone number 
and the user's recognition of the communicating party's 
voice, and therefore there is a problem that even a call 
from an unknown telephone number (not stored in the mobile 

20 telephone) must be judged based on only voice if the caller 
refers to himself /herself as a family member or 
acquaintance, and the user may suffer damage from an 
ill-intentioned person. 

[0005] Furthermore, in the case of a personal 

25 authentication system and personal authentication 
apparatus in mobile communication, personal 
authentication is performed by only verifying the voice 
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generated by the portable terminal user according to 
predetermined text information transmitted from the 
server against the registered user's voice every time 
personal authentication is performed^ and therefore 
5 personal authentication is possible only between the 
server and portable terminal , and personal authentication 
of the communicating party is not possible between 
communication terminals, and therefore it is not possible 
to prevent communication with ill-intentioned people. 

10 [0006] Itisan ob j ect of the present invention to provide 
a voice authentication apparatus, voice authentication 
system and voice authentication method capable of 
sampling arbitrary voice of the communicating party of 
the incoming call, performing personal identification 

15 through personal authentication and reliably recognizing 
whether the communicating party of the incoming call is 
the person himself /herself or not and thus preventing 
communication with ill-intentioned people, 

20 Means for Solving the Problem 

[0007 ] The voice authentication apparatus of the present 
invention is a voice authentication apparatus mounted 
on a communication terminal apparatus, having a storage 
section that stores a voice sample for each of a plurality 

25 of communicating parties, a specification section that 
specifies the communicating party of an incoming call, 
a verification section that reads the voice sample of 
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the communicating party specified by the specification 
section from the storage section, verifies the received 
voice of the communicating party against the voice sample 
and outputs the verification result and a reporting 
5 section that reports the verification result outputted 
from the verification section. 

Advantageous Effect of the Invention 

[0008] According to the present invention, it is possible 
10 to reliably perform personal identification between 
communication terminal apparatus based on a communicating 
party's voice of an incoming call and prevent 
communication with ill-intentioned people, 

15 Brief Description of Drawings 
[0009] 

FIG.l illustrates the configuration of a mobile 
telephone system according to Embodiment 1 of the present 
invention ; 

20 FIG. 2 is a block diagram showing the internal 

configuration of a mobile telephone apparatus according 
to this Embodiment 1; 

FIG. 3 illustrates the configuration of a mobile 
telephone system according to Embodiment 2 of the present 
25 invention; 

FIG- 4 is a block diagram showing the internal 
configuration of the mobile telephone apparatus and 
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remote terminal apparatus according to this Embodiment 
2; 

FIG. 5 illustrates the configuration of a mobile 
telephone system according to Embodiment 3 of the present 
5 invention; 

FIG. 6 is a block diagram showing the internal 
configuration of the mobile telephone apparatus and 
remote terminal apparatus according to this Embodiment 
3; 

10 FIG. 7 shows the configuration of a mobile telephone 

system according to Embodiment 4 of the present invention; 
and 

FIG. 8 is a block diagram showing the internal 
configuration of the mobile telephone apparatus and 
15 remote terminal apparatus according to this Embodiment 
4 . 



Best Mode for Carrying Out the Invention 
[0010] Now, embodiments of the present invention will 
20 be described below in detail with reference to the 
accompanying drawings . 

[0011] (Embodiment 1) 

This Embodiment 1 will explain a case where the 
25 communication terminal apparatus of the present invention 
is applied to a mobile telephone apparatus. FIG.l 
illustrates the configuration of a mobile telephone 
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system according to this Embodiment 1, and FIG. 2 is a 
block diagram showing a voice authentication function 
mounted on a mobile telephone apparatus. 
[0012] The mobile telephone system in FIG.l shows a case 
5 where mobile telephone apparatus 100 of user A on the 
transmitting side and mobile telephone apparatus 200 of 
user B on the receiving side are connected via network 
N. In this case, the configuration of a radio base station 
or the like necessary for the mobile telephone system 

10 is omitted. 

[0013] FIG. 2 is a block diagram mainly showing the voice 
authentication function which is mounted on mobile 
telephone apparatus 200 and mainly configured with user 
specification input section 201, voice data sample 

15 storage section 202, voice verification section 203, 
display section 204 and communication section 205. 
[0014] User specification input section 201 is provided 
with an operation key for user B to specify the 
communicating party (user A in this case) when a 

20 conversation starts after a call arrives, and outputs 
a user specif ication signal (including, for example, name 
and registered number registered in a telephone 
directory) to voice data s ample storage section 202 . User 
specification may also be inputted with voice in addition 

25 to the operation key. 

[0015] Voice data sample storage section 202 is a memory 
that stores voice sample data of a plurality of 
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communicating parties in association with identification 
information about the communicating parties (e.g. , names 
and registered numbers registered in a telephone 
directory) and in this case, suppose the voice sample 
5 data ofuserA is storedbeforehand. Timing at which voice 
sample data is stored may be arbitrary timing according 
to a storage instruction by user B. By making it possible 
to specify the timing for storing voice sample data 
arbitrarily, it is possible to store arbitrary voice as 

10 voice sample data without causing the communicating party 
to take time and trouble registering voice samples . Voice 
data sample storage section 202 reads corresponding voice 
sample data according to the user specification signal 
(including, for example, name and registered number 

15 registered in a telephone directory) inputted from user 
specification input section 201 and outputs the voice 
sample data to voice verification section 203 as the 
corresponding sample signal. 

[0016] Voice verification section 203 verifies a 
20 received voice signal inputted from communication section 
205 against the corresponding sample signal inputted from 
voice data sample storage section 202 and outputs the 
verification result as a verification result signal to 
display section 204. Voice verification section 203 
25 verifies the waveform characteristic or the like of the 
received voice signal against the corresponding sample 
signal and outputs a verification result signal (e.g.. 
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OK, NG) which indicates whether the communicating party 
is the person himself /herself or not according to the 
degree of similarity or the like. 

[0017] Display section 204 is configured with a liquid 
5 crystal display panel or the like, displays ^'OK'' or ^'NG" 
as the authentication result fromthe verification result 
signal (e.g., OK, NG) inputted from voice verification 
section 203 and reports to user B whether the communicating 
party is the person himself /herself or not. 

10 [0018] Communication section 205 has a communication 
function that supports a mobile telephone system, carries 
out transmission/reception of communication commands 
that support the mobile telephone system and 
transmission/reception of voice signals to/from mobile 

15 telephone apparatus 100 via network N and outputs the 
received voice signal to voice verification section 203. 
[0019] Next, the operation of mobile telephone apparatus 
200 will be explained with reference to FIG.l and FIG. 2. 
[0020] User A transmits a signal from mobile telephone 

20 apparatus 100 to mobile telephone apparatus 200 of user 
B, the signal arrives at mobile telephone apparatus 200 
of user B via network N, user B performs an off-hook 
operation according to a report of the incoming call, 
and then a conversation with user A starts . At this time , 

25 immediately after starting the conversation, when user 
B hears the voice of user A and specifies user A through 
key operation of user specification input section 201, 
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a user specification signal specifying user A (including, 
for example, name and registered number registered in 
a telephone directory) is outputted to voice data sample 
storage section 202 . Furthermore, communication section 
5 205 outputs the received voice signal received frommobile 
telephone apparatus 100 of user A to voice verification 
section 2 0 3. 

[0021] Voice data sample storage section 202 reads voice 
sample data corresponding to user A according to the user 

10 specification signal (including, for example, name and 
registered number registered in a telephone directory) 
inputted from user specification input section 201 and 
outputs the voice sample data as the corresponding sample 
signal to voice verification section 203. Voice 

15 verification section 203 verifies the received voice 
signal inputted from communication section 205 against 
the corresponding sample signal of user A inputted from 
voice data sample storage section 202 and outputs the 
verification result as a verification result signal to 

20 display section 204. 

[0022] Display section 204 displays ^^OK" or "^NG" as the 
authentication result from the verification result signal 
(e.g., OK, NG) inputted from voice verification section 
203 and reports to user B whether user A who is the 

25 communicating party is the person himself /herself or not. 
As a result, user B can immediately decide whether to 
continue or discontinue the conversation from the 
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authentication result displayed on display section 204, 
[0023] As described above, since mobile telephone 
apparatus 200 of this Embodiment 1 is provided with a 
voice authentication function for authenticating whether 
5 the communicating party is the person himsel f /her sel f 
or not based on the communicating party's voice, it is 
possible to reliably perform personal authentication of 
the communicating party in a conversation between mobile 
telephone apparatus, prevent communication with 
10 ill-intentioned people and reliably prevent damages by 
spoofing or the like, 

[0024] In addition, the above described embodiment has 
explained the case where mobile telephone apparatus 200 
is provided with the voice authentication function, but 

15 mobile telephone apparatus 100 may also be provided with 
that function. Furthermore, this function may be 
provided not only for a mobile telephone apparatus but 
also for a fixed-line phone apparatus and other 
communication terminal apparatus having a telephone 

20 function. 

[0025] On the other hand, when the voice verification 
result is ^^NG", information about user A (including, for 
example, name and telephone number registered in a 
telephone directory) and alarm information including the 
25 fact that user A is not the person himsel f /her sel f may 
be automatically reported to an external security 
organization such as the police or a security company 
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registered in mobile telephone apparatus 200 using the 
voice verification result ''NG'' as a trigger. In this case, 
it is possible to make use of the voice authentication 
function applied to the mobile telephone apparatus for 
5 crime prevention and improve the reliability of the mobile 
telephone apparatus. Furthermore, when the voice 
verification result is ^^NG", if the conversation with 
the communicating party is automatically recorded, it 
is possible to easily use the recorded content as evidence 

10 when the conversation is a criminal act. 
[0026] (Embodiment 2) 

This Embodiment 2 will explain a case where the voice 
authentication system of the present invention is applied 
to a mobile telephone system. FIG. 3 shows the system 

15 configuration of a mobile telephone system according to 
Embodiment 2 of the present invention, and FIG. 4 is a 
block diagram showing the voice authentication function 
mounted on a mobile telephone apparatus and remote 
terminal apparatus . 

20 [0027] The mobile telephone system in FIG. 3 shows a case 
where mobile telephone apparatus 100 of user A on the 
transmitting side, mobile telephone apparatus 300 of user 
B on the receiving side and remote terminal apparatus 
400 are connected via network N. In this case, the 

25 configuration of a radio base station or the like necessary 
for the mobile telephone system is omitted. 
[0028] FIG. 4 is a block diagram mainly showing the voice 
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authentication function mounted on mobile telephone 
apparatus 300 and remote terminal apparatus 400, and the 
same components as those in mobile telephone apparatus 
200 shown in FIG. 2 are assigned the same reference numerals 
5 and explanations thereof will be omitted. Mobile 
telephone apparatus 300 is mainly configured with user 
specification input section 201, transmission section 
301, reception section 302 and display section 204. 
Remote terminal apparatus 400 is mainly configured with 
10 reception section 401, voice data sample storage section 
402, voice verification section 403 and transmission 
section 404 . In addition, in mobile telephone apparatus 

300 and remote terminal apparatus 400, when communication 
is performed between mobile telephone apparatus 300 and 

15 remote terminal apparatus 400, suppose that a key for 
encrypting and decrypting information to be 
transmitted/received is stored in transmission section 

301 and reception section 302, and reception section 401 
and transmission section 404, respectively. 

20 [0029] Transmission section 301 encrypts a received 
voice signal receivedby reception section 302 frommobile 
telephone apparatus 100 of user A and a user specification 
signal (including, for example, name and registered 
number registered in a telephone directory) inputted from 

25 user specification input section 201 using the above 
described key and transmits the encrypted signals as an 
encrypted file to remote terminal apparatus 400 via 
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network N . 

[0030] Reception section 302 receives the voice signal 
transmitted from mobile telephone apparatus 100 of user 
A via network N, outputs the received voice signal to 
5 transmission section 301, also receives the encrypted 
file (including the verification result) transmitted from 
remote terminal apparatus 400 via network decrypts 
the verification result from the encrypted file using 
the above described key and outputs the decrypted 

10 verification result to display section 204. 

[0031] Next, the configuration of remote terminal 
apparatus 400 will be explained. Reception section 401 
receives the encrypted file (including the received voice 
signal and user specification signal) transmitted from 

15 mobile telephone apparatus 300 of user B via network N, 
decrypts the received voice signal and user specification 
signal from the encrypted file using the above described 
key, and outputs the user specification signal to voice 
data sample storage section 402 and outputs the received 

20 voice signal to voice verification section 403. 

[0032] Voice data sample storage section 402 is a memory 
that stores voice sample data of a plurality of 
communicating parties of user B in association with 
identification information (e.g., names and registered 

25 numbers registered in a telephone directory) of the 
communicating parties, and in this case suppose that the 
voice sample data of user A is stored beforehand. The 
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timing at which voice sample data is stored may be arbitrary 
timing according to a storage instruction by user B. By 
making it possible to specify the timing for storing voice 
sample data arbitrarily, it is possible to store arbitrary 
5 voice as voice sample data without causing the 
communicating party to take time and trouble registering 
the voice samples. When the user specification signal 
(including, for example, name and registered number 
registered in a telephone directory) is inputted from 

10 reception section 401, voice data sample storage section 
402 reads the corresponding voice sample data and outputs 
the corresponding voice sample data as the corresponding 
sample signal to voice verification section 403. 
[0033] Voice verification section 403 verifies the 

15 received voice signal inputted from reception section 
401 against the corresponding sample signal inputted from 
voice data sample storage section 402 and outputs the 
verification result as a verification result signal to 
transmission section 404. Voice verification section 

20 403 verifies the waveform characteristic or the like of 
the received voice signal against the corresponding 
sample signal and outputs a verification result signal 
(e.g., OK, NG) indicating whether the communicating party 
is the person himsel f /her sel f or not based on the degree 

25 of similarity or the like. 

[0034] Transmission section 404 encrypts the 

verification result signal inputted from voice 
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verification section 403 using the above described key 
and transmits the encrypted signal as an encrypted file 
to mobile telephone apparatus 300 via network N. 
[0035] Next, the operations of mobile telephone 
5 apparatus 300 and remote terminal apparatus 400 will be 
explained with reference to FIG. 3 and FIG. 4. 
[0036] User A transmits a signal from mobile telephone 
apparatus 100 to mobile telephone apparatus 300 of user 
B, the signal arrives at mobile telephone apparatus 300 

10 of user B via network user B performs an off-hook 

operation according to a report of the incoming call, 
and then a conversation with user A starts . At this time, 
immediately after starting the conversation, when user 
B hears the voice of user A and specifies user A through 

15 key operation of user specification input section 201, 
a user specification signal that specifies user A 
(including, for example, name and registered number 
registered in a telephone directory) is outputted to 
transmission section 301. Furthermore, when a user 

20 specification signal is inputted from user specification 
input section 201 and a received voice signal received 
by reception section 302 from mobile telephone apparatus 
100 of user A is inputted, transmission section 301 
encrypts the user specification signal and received voice 

25 signal using the above described key and transmits the 
encrypted signals as an encrypted file to remote terminal 
apparatus 400 via network N. 
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[0037] Reception section 401 of remote terminal 
apparatus 400 receives the encrypted file (including the 
received voice signal and user specification signal) 
transmitted from mobile telephone apparatus 300 of user 
5 B via network decrypts the received voice signal and 
user specification signal from the encrypted file using 
the above described key, outputs the user specification 
signal to voice data sample storage section 402 and also 
outputs the received voice signal to voice verification 

10 section 403. 

[0038] Voice data sample storage section 402 reads the 
voice sample data corresponding to user A according to 
the user specification signal (including, for example, 
name and registered number registered in a telephone 

15 directory) inputted from reception section 401 and 
outputs the voice sample data as a corresponding sample 
signal to voice verification section 403. Voice 
verification section 403 verifies the received voice 
signal inputted from reception section 401 against the 

20 corresponding sample signal of user A inputted from voice 
data sample storage section 402 and outputs the 
verification result as a verification result signal to 
transmission section 404. 

[0039] Transmission section 404 encrypts the 

25 verification result signal inputted from voice 
verification section 403 using the above described key 
and transmits the encrypted signal as an encrypted file 
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to mobile telephone apparatus 300 via network N. 
Reception section 302 of mobile telephone apparatus 300 
receives the encrypted fil^ transmitted from remote 
terminal apparatus 400 via network N, decrypts the 
5 verification result signal from the encrypted file using 
the above described key and outputs the verification 
result signal to display section 204. 

[0040] Display section 204 displays ^"OK" or "^NG" from 
the verification result signal (e.g., OK, NG) inputted 

10 from reception section 302 as an authentication result 
and reports to user B whether user A who is the 
communicating party is the person himself /herself or not. 
As a result, user B can immediately decide whether to 
continue or discontinue the conversation from the 

15 authentication result displayed on display section 204. 
[0041] As described above , in the mobi le telephone system 
of this Embodiment 2, remote terminal apparatus 400 is 
provided with the voice authentication function for 
authenticating whether the communicating party is the 

20 person himself /herself or not based on the communicating 
party's voice, and therefore it is possible to reliably 
perform personal authentication of the communicating 
party in a conversation between mobile telephone 
apparatus, prevent communication with ill-intentioned 

25 people and reliably prevent damages by spoofing or the 
like . 

[0042] Furthermore, when communication between mobile 
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telephone apparatus 300 and remote terminal apparatus 
400 is performed/ information is encrypted when 
transmitted/received, and therefore it is possible to 
prevent the information content from leaking out to a 
5 third party and improve reliability of the mobile 
telephone system. Furthermore, when the voice 

verification result is ^^NG'', information about user A 
(including, for example, name and telephone number 
registered in a telephone directory) and alarm 

10 information including the fact that user A is not the 
person himself /herself may be automatically reported to 
an external security organization such as the police or 
a security company registered in mobile telephone 
apparatus 400 using the voice verification result ^^NG" 

15 as a trigger. In this case, it is possible to make use 
of the mobile telephone system for crime prevention and 
further improve the reliability of the mobile telephone 
system. Furthermore, when the voice verification result 
is ^^NG", if the conversation with the communicating party 

20 is automatically recorded in the mobile telephone 
apparatus, it is possible to easily use the recorded 
content as evidence when the conversation is a criminal 
act - 

[0043] Furthermore, since the voice samples and voice 
25 verification function are provided on the remote terminal 
apparatus side, it is possible to facilitate upgrade of 
the voice verification function and storage of the voice 
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sample, and also maintain reduction in size and weight 
of the mobile telephone apparatus without substantially 
using resources of the mobile telephone apparatus. 
Furthermore, the remote terminal apparatus may be 
5 provided by user B himself /herself or already provided 
by another person or company or the like. For example, 
when user A refers to himself /herself as an ^^employee 
of company C", user B can. also make a remote terminal 
apparatus of company C perform voice verification. In 

10 this way, even in the case of a conversation with an unknown 
communicating party (with no voice sample data), the 
identity of user A is guaranteed by company C and it is 
possible to provide a voice authentication service 
between mobile telephone apparatus not only between 

15 individuals but also between employees of a company or 
between a company and individuals . 
[0044] (Embodiment 3) 

This Embodiment 3 will explain a case where the voice 
authentication system using the present invention is 

20 applied to a mobile telephone system. FIG. 5 shows the 
system configuration of a mobile telephone system 
according to this Embodiment 3 , and FIG.6isablock diagram 
showing a voice authentication function mounted on a 
mobile telephone apparatus and remote terminal apparatus . 

25 [0045] The mobile telephone system in FIG. 5 shows a case 
where mobile telephone apparatus 100 of user A on the 
transmitting side, mobile telephone apparatus 500 of user 
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B on the receiving side and remote terminal apparatus 
600 are connected via network N. In this case, the 
configuration of a radio base station or the like necessary 
for the mobile telephone system is omitted. 
5 [0046] FIG. 6 is a block diagram mainly showing the voice 
authentication function mounted on mobile telephone 
apparatus 500 and remote terminal apparatus 600 and the 
same components as those of mobile telephone apparatus 
200, 400 shown in FIG. 2 and FIG. 4 are assigned the same 

10 reference numerals and explanations thereof will be 
omitted. Mobile telephone apparatus 500 is mainly 
configured with user specification input section 201, 
voice data sample storage section 202, transmission 
section 501, reception section 302 and display section 

15 204. Remote terminal apparatus 600 is mainly configured 
with reception section 601, voice verification section 
602 and transmission section 603. In addition, in mobile 
telephone apparatus 500 and remote terminal apparatus 
600, when communication is performed between mobile 

20 telephone apparatus 500 and remote terminal apparatus 
600, suppose that a key for encrypting and decrypting 
information to be transmitted/received is stored in 
transmission section 501 and reception section 302, and 
reception section 601 and transmission section 603 

25 respectively. 

[0047] Whena received voice signal received by reception 
section 302 from mobile telephone apparatus 100 of user 
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A is inputted and a corresponding sample signal of the 
voice of user A is inputted from voice data sample storage 
section 202 , transmission section 501 of mobile telephone 
apparatus 500 encrypts' the signals using the above 
5 described key and transmits the encrypted signals as an 
encrypted file to remote terminal apparatus 600 via 
network N. 

[0048] Reception section 601 of remote terminal 
apparatus 600 receives the encrypted file (including the 

10 received voice signal and corresponding sample signal) 
transmitted from mobile telephone apparatus 500 of user 
B via network decrypts the received voice signal and 
corresponding sample signal from the encrypted file using 
the above described key and outputs the decrypted signals 

15 to voice verification section 602 . Voice verification 
section 602 verifies the received voice signal against 
the corresponding sample signal inputted from reception 
section 601 and outputs the verification result as a 
verification result signal to transmission section 603. 

20 Voice verification section 602 verifies the waveform 
characteristic or the like of the received voice signal 
against the corresponding sample signal and outputs a 
verification result signal (e.g., OK, NG) indicating 
whether the communicating party is the person 

25 himself /herself or not according to the degree of 
similarity to transmission section 603. Transmission 
section 603 encrypts the verification result signal 
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inputted from voice verification section 602 using the 
above described key and transmits the encrypted signal 
as an encrypted file to^mobile telephone apparatus 500 
via network N. 

5 [0049] Inthe above described configuration of the mobile 
telephone system, remote terminal apparatus 600 is 
provided with the voice verification function for 
verifying the voice of the communicating party, and 
therefore it is possible to reliably perform personal 

10 authentication of the communicating party in a 
conversation between mobile telephone apparatus and 
prevent communication with ill-intentioned people and 
reliably prevent damages by spoofing or the like. 
Furthermore, the voice verification function is provided 

15 on the remote terminal apparatus side, and therefore it 
is possible to allow a plurality of mobile telephone 
apparatus users to share a voice verification application 
and facilitate upgrade or change of the voice verification 
function . 

20 [0050] Furthermore, when communication is performed 
between mobile telephone apparatus 500 and remote 
terminal apparatus 600, information is encrypted when 
transmitted/received, and therefore it is possible to 
prevent the information content from leaking out to a 

25 third party and improve reliability of the mobile 
telephone system. Furthermore, when the voice 

verification result is ^^NG", information about user A 
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(including, for example, name and telephone number 
registered in a telephone directory) and alarm 
information including the fact that user A is not the 
person himself /herself may be automatically reported to 
5 an external security organization such as the police or 
a security company registered in remote terminal 
apparatus 600 using the voice verification result ^'NG" 
as a trigger. In this case, it is possible to make use 
of the mobile telephone system for crime prevention and 

10 further improve the reliability of the mobile telephone 
system. Furthermore, when the voice verification result 
is ^^NG", if the conversation with the communicating party 
is automatically recorded in the mobile telephone 
apparatus, it is possible to easily use the recorded 

15 content as evidence when the conversation is a criminal 
act . 

[0051] (Embodiment 4) 

This Embodiment 4 will explain a case where the voice 
authentication system of the present invention is applied 

20 to a mobile telephone system. FIG. 7 shows the system 
configuration of a mobile telephone system according to 
this Embodiment 4, and FIG. 8 is a block diagram showing 
a voice authentication function mounted on a mobile 
telephone apparatus and remote terminal apparatus. 

25 [0052] The mobile telephone system in FIG. 7 shows a case 
where mobile telephone apparatus 100 of user A on the 
transmitting side, mobile telephone apparatus 700 of user 
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B on the receiving side and remote terminal apparatus 
800 are connected via network N. In this case, the 
configuration of a radio base station or the like necessary 
for the mobile telephone system is omitted. 
5 [0053] FIG-7 is a block diagram mainly showing the voice 
authentication function mounted on mobile telephone 
apparatus 700 and remote terminal apparatus 800, and the 
same components as those of mobile telephone apparatus 
200, 400 shown in FIG. 2 and FIG. 4 are assigned the same 

10 reference numerals and explanations thereof will be 
omitted. Mobile telephone apparatus 700 is mainly 
configured with user specification input section 201, 
voice verification section 2 03, trans mi ssion section 701, 
reception section 702 and display section 204. Remote 

15 terminal apparatus 800 is mainly configured with 
reception section 801, voice data sample storage section 
802, transmission section 803. In addition, in mobile 
telephone apparatus 700 and remote terminal apparatus 
800, when communication is performed between mobile 

20 telephone apparatus 700 and remote terminal apparatus 
800, suppose that a key for encrypting and decrypting 
information to be transmitted/received is stored in 
transmission section 701 and reception section 702, and 
reception section 801 and transmission section 803, 

25 respectively. 

[0054] Transmission section 701 of mobile telephone 
apparatus 700 encrypts a user specification signal 
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inputted from user specification input section 201 using 

the above described key and transmits the encrypted signal 
as an encrypted file to remote terminal apparatus 800 
via network N. 

5 [0055] Reception section 801 of remote terminal 
apparatus 800 receives the encrypted file (including the 
user specification signal) transmitted from mobile 
telephone apparatus 700 of user B via network N, decrypts 
the user specification signal from the encrypted file 

10 using the above described key and outputs the decrypted 
signal to voice data sample storage section 802. Voice 
data sample storage section 802 reads the voice sample 
data corresponding to user A according to the user 
specification signal (including, for example, name and 

15 registered number registered in a telephone directory) 
inputted from reception section 801 and outputs the voice 
data sample to transmission section 803 as a corresponding 
sample signal. Transmission section 803 encrypts the 
corresponding sample signal inputted from voice data 

20 sample storage section 802 using the above described key 
and transmits the encrypted signal as an encrypted file 
to mobile telephone apparatus 700 via network N. 
[0056] Reception section 702 of mobile telephone 
apparatus 700 receives the voice signal transmitted from 

25 mobile telephone apparatus 100 of user A via network N 
and outputs the received voice signal to voice 
verification section 203. Furthermore, reception 
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section 702 receives the encrypted file (including the 
corresponding sample signal of user A) transmitted from 
remote terminal apparatus 800 via network N, decrypts 
the corresponding sample signal from the encrypted file 
5 using the above described key and outputs the decrypted 
signal to voice verification section 203. 
[0057] In the above described configuration of the mobile 
telephone system, remote terminal apparatus 800 is 
provided with the voice data sample storage function for 

10 storing a voice sample of the communicating party^ and 
mobile telephone apparatus 700 is provided with the voice 
recognition function for recognizing whether the voice 
of the communicating party is the voice of the person 
himself /herself or not, and therefore it is possible to 

15 reliably perform personal authentication of the 
communicating party in a conversation between the mobile 
telephone apparatus and prevent communication with 
ill-intentioned people and reliably prevent damages by 
spoofing or the like. 

20 [0058] Furthermore, since the voice data sample storage 
function is provided on the remote terminal apparatus 
side, it is possible to allow a plurality of mobile 
telephone apparatus users to share the voice sample 
storage function, and thereby making it possible to omit 

25 a memory for storing the voice sample on the mobile 
telephone apparatus side and reduce the memory capacity 
mounted on the mobile telephone apparatus . 
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[0059] Furthermore, when communication is performed 
between mobile telephone apparatus 700 and remote 
terminal apparatus 800, information is encrypted when 
transmitted/received, and therefore it is possible to 
5 prevent the information content from leaking out to a 
third party and improve reliability of the mobile 
telephone system. Furthermore, when the voice 

verification result is ^^NG", information about user A 
(including, for example, name and telephone number 

10 registered in a telephone directory) and alarm 
information including the fact that user A is not the 
person himself /herself may be automatically reported to 
an external security organization such as the police or 
a security company registered in remote terminal 

15 apparatus 800 using the voice verification result ^^NG" 
as a trigger. In this case, it is possible to make use 
of the mobile telephone system for crime prevention and 
further improve the reliability of the mobile telephone 
system. Furthermore, when the voice verification result 

20 is ^^NG", if the conversation with the communicating party 
is automatically recorded in the mobile telephone 
apparatus, it is possible to easily use the recorded 
content as evidence when the conversation is a criminal 
act . 

25 [0060] A first aspect of the voice authentication 
apparatus of the present invention is a voice 
authentication apparatus mounted on a communication 
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terminal apparatus having a storage section that stores 
a voice sample for each of a plurality of communicating 
parties, a specification section that specifies the 
communicating party of an incoming call, a verification 
5 section that reads the voice sample of the communicating 
party specified by the specification section from the 
storage section, verifies a received voice of the 
communicating party against the voice sample and outputs 
the verification result and a reporting section that 
10 reports the verification result outputted from the 
verification section . 

[0061] According to this configuration, it is possible 
to reliably perform personal identification based on the 
communicating party' s voice of the incoming call between 

15 communication terminal apparatus and thereby prevent 
communication with ill-intentioned people. 
[0062] A second aspect of the voice authentication 
apparatus of the present invention is a voice 
authentication system configured with a communication 

20 terminal apparatus and a remote terminal apparatus, the 
communication terminal apparatus having a specification 
section that specifies the communicating party of an 
incoming call, a transmission section that transmits 
information about the communicating party specified by 

25 the specification section and a received voice of the 
communicating party to the remote terminal apparatus, 
and a reporting section that receives and reports a 
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verification result of the communicating party 
transmitted from the remote terminal apparatus, and the 
remote terminal apparatus having a storage section that 
stores a voice sample for each of a plurality of 
5 communicating parties, a reception section that receives 
information about the communicating party transmitted 
from the communication terminal apparatus and a received 
voice of the communicating party, a verification section 
that reads a corresponding voice sample of the 

10 communicating party from the storage section according 
to the information about the communicating party received 
by the reception section, verifies the received voice 
of the communicating party against the voice sample and 
outputs the verification result, and a transmission. 

15 section that transmits the verification result, output ted 
from the verification section to the communication 
terminal apparatus . 

[0063] According to this configuration, it is possible 
to reliably perform personal identification based on the 

20 communicating party' s voice of the. incoming call between 
communication terminal apparatus and prevent 
communication with ill-intentioned people . Furthermore , 
since the voice sample and voice verification function 
are provided on the remote terminal apparatus side, it 

25 is possible to facilitate upgrade of the voice 
verification function and storage of the voice sample 
and also maintain reduction in size and weight of the 
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communication terminal apparatus without substantially 
using resources of the communication terminal apparatus. 
[0064] A third aspect of the voice authentication 
apparatus of the present invention is a voice 
5 authentication system configured with a communication 
terminal apparatus and a remote terminal apparatus, the 
communication terminal apparatus having a storage section 
that stores a voice sample for each of a plurality of 
communicating parties, a specification section that 

10 specifies the communicating party of an incoming call, 
a transmission section that reads the voice sample of 
the communicating party specified by the specification 
section from the storage section and transmits the voice 
sample together with a received voice of the communicating 

15 party to the remote terminal apparatus, and a reporting 
section that receives and reports a verification result 
of the communicating party transmitted from the remote 
terminal apparatus, and the remote terminal apparatus 
having a reception section that receives the voice sample 

20 of the communicating party transmitted from the 
communication terminal apparatus and a received voice 
of the communicating party, a verification section that 
verifies the received voice of the communicating party 
against the voice sample of the communicating party 

25 received from the reception section and outputs the 
verification result, and a transmission section that 
transmits the verification result outputted from the 
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verification section to the communication terminal 
apparatus . 

[0065] According to this configuration, it is possible 
to reliably perform personal identification based on the 
5 communicating party's voice of the incoming call between 
communication terminal apparatus and prevent 
communication with ill-intentioned people. Furthermore , 
since the voice verification function is provided on the 
remote terminal apparatus side, it is possible to allow 
10 a plurality of communication terminal apparatus users 
to share the voice verification application and 
facilitate upgrade and change of the voice verification 
function . 

[0066] A fourth aspect of the voice authentication system 
15 of the present invention is a voice authentication system 
configured with a communication terminal apparatus and 
a remote terminal apparatus, the communication terminal 
apparatus having a specification section that specifies 
the communicating party of an incoming call, a 
20 transmission section that transmits information about 
the communicating party to the remote terminal apparatus, 
a reception section that receives a voice sample of the 
communicating party transmitted from the remote terminal 
apparatus according to the transmitted information about 
25 the communicating party, a verification section that 
verifies a received voice of the communicating party 
against the voice sample of the communicating party 
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received by the reception section and outputs the 
verification result, and a reporting section that reports 
the verification result outputted from the verification 
section, and the remote terminal apparatus having a 
5 storage section that stores a voice sample for each of 
a plurality of communicating parties, a reception section 
that receives information about the communicating party 
transmitted from the communication terminal apparatus, 
and a transmission section that reads the voice sample 
10 of the communicating party corresponding to the received 
information about the communicating party from the 
storage section and transmits the voice sample to the 
communication terminal apparatus. 

[0067] According to this configuration, it is possible 
15 to reliably perform personal identification based on the 
communicating party's voice of the incoming call between 
communication terminal apparatus and prevent 
communication with ill-intentioned people . Furthermore , 
since the voice data sample storage function is provided 
20 on the remote terminal apparatus side, it is possible 
to allow a plurality of communication terminal apparatus 
users to share the voice sample storage function, and 
thereby making it possible to omit a memory for storing 
the voice sample on the communication terminal apparatus 
25 side and reduce the memory capacity mounted on the 
communication terminal apparatus . 

[0068 ] A fifth aspect of the voice authentication system 
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of the present invention is the voice authentication 
system according to the second aspect of the present 
invention, wherein the transmission section of the 
communication terminal apparatus encrypts the 
5 information to be transmitted to the remote terminal 
apparatus, the reporting section decrypts the encrypted 
information received from the remote terminal apparatus, 
and the reception section of the remote terminal apparatus 
decrypts the encrypted information received from the 
10 remote terminal apparatus and the transmission section 
encrypts information to be transmitted to the 
communication terminal apparatus. 

[0069] According to this configuration, it is possible 
to prevent information content from leaking out to a third 

15 party and improve reliability of the communication system 
applying the voice authentication system. 
[0070] A sixth aspect of the voice authentication system 
of the present invention is the voice authentication 
system according to the second aspect of the present 

20 invention, wherein in the remote terminal apparatus, when 
the voice sample of the communicating party does not match 
the received voice sample as the verification result 
outputted from the verification section, the transmission 
section reports alarm information indicating that the 

25 communicating party is not the person himself /herself 
to an external security organization. 

[0071] According to this configuration, it is possible 
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to make use of the voice authentication system for crime 
prevention and further improve reliability of the 
communication system applying the voice authentication 
system. 

5 [0072] A seventh aspect of the voice authentication 
system of the present invention is the voice 
authentication system according to the second aspect of 
the present invention, wherein in the communication 
terminal apparatus, when the voice sample of the 

10 communicating party does not match the received voice 
sample as the verification result outputted from the 
verification section, the reception section starts 
recording a conversation with the communicating party, 
[0073] According to this configuration, it is possible 

15 to easily use the recorded content as evidence when a 
conversation is a criminal act. 

[0074] An eighth aspect of the voice authentication 
method of the present invention is a voice authentication 
method for authenticating a communicating party's voice 

20 of an incoming call at a communication terminal apparatus, 
having a storing step of storing a voice sample for each 
of a plurality of communicating parties in a storage 
section, a specifying step of specifying the 
communicating party of the incoming call, a verifying 

25 step of reading the voice sample of the specified 
communicating party from the storage section, verifying 
a received voice of the communicating party against the 
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voice sample and outputting the verification result, and 
a reporting step of reporting the verification result. 
[0075] According to this method, it is possible to 
reliably perform personal identification based on the 
5 communicating party' s voice of the incoming call between 
communication terminal apparatus and prevent 
communication with ill-intentioned people. 
[0076] A ninth aspect of the voice authentication method 
of the present invention is a voice authentication method 

10 for authenticating a communicating party's voice of an 
incoming call at a communication terminal apparatus in 
a communication system configured with the communication 
terminal apparatus and a remote terminal apparatus, the 
communication terminal apparatus having a specifying step 

15 of specifying the communicating party of the incoming 
call, a transmitting step of transmitting information 
about the specified communicating party and a received 
voice of the communicating party to the remote terminal 
apparatus and a reporting step of receiving and reporting 

20 a verification result of the communicating party 
transmitted from the remote terminal apparatus, and the 
remote terminal apparatus having a storing step of storing 
a voice sample for each of a plurality of communicating 
parties in a storage section, a receiving step of receiving 

25 information about the communicating party transmitted 
from the communication terminal apparatus and a received 
voice of the communicating party, a verifying step of 
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reading a corresponding voice sample of the communicating 
party from the storage section according to the received 
information about the communicating party, verifying the 
received voice of the communicating party against the 
5 voice sample and outputting the verification result, and 
a transmitting step of transmitting the verification 
result to the communication terminal apparatus . 
[0077] According to this method, it is possible to 
reliably perform personal identification based on the 

10 communicating party' s voice of the incoming call between 
communication terminal apparatus and prevent 
communication with ill-intentioned people . Furthermore, 
since the voice samples and voice verification function 
are provided on the remote terminal apparatus side, it 

15 is possible to facilitate upgrade of the voice 
verification function and storage of the voice sample 
and also maintain reduction in size and weight of the 
communication terminal apparatus without substantially 
using resources of the communication terminal apparatus. 

20 [0078] A tenth aspect of the voice authentication method 
of the present invention is a voice authentication method 
for authenticating a communicating party' s voice of an 
incoming call at a communication terminal apparatus in 
a communication system configured with the communication 

25 terminal apparatus and a remote terminal apparatus, the 
communication terminal apparatus having a storing step 
of storing a voice sample for each of a plurality of 
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communicating parties in a storage section, a specifying 
step of specifying the communicating party of the incoming 
call, a transmitting step of reading the voice sample 
of the specified communicating party from the storage 
5 section and transmitting the voice sample together with 
a received voice of the communicating party to the remote 
terminal apparatus and a reporting step of receiving and 
reporting a verification result of the communicating 
party transmitted from the remote terminal apparatus, 

10 and the remote terminal apparatus having a receiving step 
of receiving a voice sample of the communicating party 
transmitted from the communication terminal apparatus 
and a received voice of the communicating party, a 
verifying step of verifying the received voice of the 

15 communicating party against the received voice sample 
of the communicating party and outputting the 
verification result, and a transmitting step of 
transmitting the verification result to the communication 
terminal apparatus . 

20 [0079] According to this method, it is possible to 
reliably perform personal identification based on the 
communicating party' s voice of the incoming call between 
communication terminal apparatus and prevent 
communication with ill-intentioned people. Furthermore , 

25 since the voice verification function is provided on the 
remote terminal apparatus side, it is possible to allow 
a plurality of communication terminal apparatus users 
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to share a voice verification application and facilitate 
upgrade and change of the voice verification function. 
[0080] An eleventh aspect of the voice authentication 
method of the present invention is a voice authentication 
5 method for authenticating a communicating party' s voice 
of an incoming call at a communication terminal apparatus 
in a communication system configured with the 
communication terminal apparatus and a remote terminal 
apparatus, the communication terminal apparatus having 

10 a specifying step of specifying the communicating party 
of the incoming call, a transmitting step of transmitting 
information about the communicating party to the remote 
terminal apparatus, a receiving step of receiving a voice 
sample of the communicating party transmitted from the 

15 remote terminal apparatus according to the transmitted 
information about the communicating party, a verifying 
step of verifying a received voice of the communicating 
party against the received voice sample of the 
communicating party and outputting the verification 

20 result and a reporting step of reporting the verification 
result, and the remote terminal apparatus having a storing 
step of storing a voice sample for each of a plurality 
of communicating parties in a storage section, a receiving 
step of receiving information about the communicating 

25 party transmitted from the communication terminal 
apparatus and a transmitting step of reading the voice 
sample of the communicating party corresponding to the 



2F04051-PCT 39 

received information about the communicating party and 
transmitting the voice sample to the communication 
terminal apparatus . 

[0081] According to this method, it is possible to 
5 reliably perform personal identification based on the 
communicating party' s voice between communication 
terminal apparatus and prevent communication with 
ill-intentioned people. Furthermore, since the voice 
data sample storage function is provided on the remote 

10 terminal apparatus side, it is possible to allow a 
plurality of communication terminal apparatus users to 
share the voice sample storage function and thereby making 
it possible to omit a memory for storing the voice sample 
on the communication terminal apparatus side and reduce 

15 the memory capacity mounted on the communication terminal 
apparatus • 

[0082] The present application is based on Japanese 
Patent Application No . 2 0 0 3 - 4 2 4 6 2 9 , filed on December 22, 
2003, the entire content of which is expressly 
20 incorporated by reference herein. 

Industrial Applicability 

[0083] The present invention is intended to reliably 
perform personal identification based on a communicating 
25 party'svoiceofan incoming call in a conversation between 
communication terminal apparatus and prevent 
communication with ill-intentioned people. 



